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Abstract 

We consider multi-antenna cooperative spectrum sensing in cognitive radio networks, when there 
may be multiple primary users. A detector based on the spherical test is analyzed in such a scenario. 
Based on the moments of the distributions involved, simple and accurate analytical formulae for the 
key performance metrics of the detector are derived. The false alarm and the detection probabilities, 
as well as the detection threshold and Receiver Operation Characteristics are available in closed form. 
Simulations are provided to verify the accuracy of the derived results, and to compare with other 
detectors in realistic sensing scenarios. 

Index Terms 

Cognitive radio; spectrum sensing; multiple primary users; the spherical test. 

I. Introduction 

Cognitive radio (CR) is a promising technique for future wireless communication systems. 
In CR networks, dynamic spectrum access is implemented to mitigate spectrum scarcity. A 
secondary (unlicensed) user is allowed to utilize the spectrum resources when it does not cause 
intolerable interference to the primary (licensed) user. A key requirement for this is the secondary 
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user's ability to detect the presence of the primary user. Thus spectrum sensing is considered as 
a key component in CR networks. 

Prior work on cooperative spectrum sensing predominately employ the assumption of a single 
active primary user. Based on this assumption, several eigenvalue based sensing algorithms have 
been proposed recently fl], CO, [0, 0, 0, Q, JH. These algorithms are non-parametric, 
i.e. they do not require information of the primary user, in contrast to e.g. feature detection. Also, 
they achieve optimality under different assumptions on the knowledge of the parameters. The 
assumption of a single primary user is made as the investigations in the literature have mainly 
focussed on CR networks, where the primary users are TV or DVB systems. In these systems 
the single active primary user assumption is, to some extent, justifiable. In addition, assuming a 
single primary user leads to analytically tractable problems. 

The single primary user assumption may fail to reflect the situation in forthcoming CR 
networks, where the primary system could be a cellular network, and the existence of more than 
one primary user would be the prevailing condition. Using existing single primary user detection 
algorithms in such a scenario will induce performance loss. Despite the need to understand 
multiple primary user detection, the results in this direction are rather limited. A heuristic 
detection algorithm based on the ratio of the extreme eigenvalues is investigated in [0, IfTOl . 
[fTT|. but its detection performance turns out to be sub-optimal [8 J. Recently, a novel detection 
algorithm in the presence of multiple primary users, based on the spherical test, has been proposed 
in [fT2ll . However, no analytical results pertaining to its statistical performance were presented. 
In this paper we analytically investigate the detection performance by deriving closed-form 
approximations for the test statistics distributions under both hypothesis. These approximations 
are obtained by matching the moments of the test statistics to the Beta distribution. Using the 
derived results we obtain analytical formulae for major performance measures, such as the false 
alarm probability, the detection probability, the decision threshold and the Receiver Operating 
Characteristic (ROC). The derived approximations are easily computable and simulations show 
that they are accurate for the considered sensor sizes, number of samples, the assumed number 
of primary users and corresponding SNRs. In addition, for the most useful system configuration 
of two sensors with arbitrary number of samples a simple form of the exact detection probability 
is derived. 

The rest of this paper is organized as follows. In Section [D] we study the test statistics for 
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the multiple primary user detection after outlining the signal model. Performance analysis of the 



chosen detection algorithm is addressed in Section |nij Section [TV] presents numerical examples 
to examine the detection performance in diverse scenarios. Finally in Section [V] we conclude 
the main results of this paper and point out some possible future research directions based on 
the results of this work. 

II. Problem Formulation 

A. Signal Model 

Consider the standard model for K-sensor cooperative detection in the presence of P primary 
users, 

x = Hs + an (1) 

where x G C A is the received data vector. The K sensors may be e.g. K receive antennas in 
one secondary terminal or K secondary devices each with a single antenna, or any combination 
of these. The K x P matrix H = [hi, . . . , hp] represents the channels between the P primary 
users and the K sensors. The P x 1 vector s = [si, . . . ,sp}' denotes zero mean transmitted 
signals from the primary users. The K X 1 vector cm is the complex Gaussian noise with zero 
mean and covariance matrix ct 2 1k, where the scalar o 2 is the noise power. 

We collect iV i.i.d observations from model ([I]) to a K x N matrix X = [xi, . . . ,xjy]. The 
problem of interest is to use the data matrix X to decide whether there are primary users [j] For 
ease of analysis we make the following assumptions 

1) The channel H is constant during sensing time. 

2) The primary user's signal follows an i.i.d zero mean Gaussian distribution and is uncor- 
rected with the noise. 

Due to the first assumption the channel model for H may not need to be specified. In the 
absence of primary users, the sample covariance matrix R = XX^ follows an uncorrelated 
(white) complex Wishart distribution Wk {N, S) with population covariance matrix 

£ :=K[XX.^]/N = a 2 l K . (2) 

'This collaborative sensing scenario is more relevant when the K sensors are in one device, since for multiple collaborating 
devices, accurate time synchronization between devices are needed and communications to the fusion center becomes an issue. 
Typically, K is less than eight due to physical constraints of the device size. 
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In the presence of primary users, by the two assumptions above, the sample covariance matrix 
R follows a correlated complex Wishart distribution. The correlation is given by the presence 
of the signals with the covariance matrix equals 

p 

£ = a 2 I^ + ^ 7 ,h 4 h4, (3) 

i=i 

where 7, := E[sjsJ] defines the transmission power of the i-th primary user. The received Signal 
to Noise Ratio (SNR) of primary user i across the K sensors is 

7»l I h,- 1 1 2 

SNRj := hn . (4) 

Finally, we denote the ordered eigenvalues of the sample covariance matrix R by Ai > A 2 > 
... > \ K . 

B. Test Statistics 

The differences between the population covariance matrices ([2]) and can be explored to 
detect the primary user. This detection problem can be formulated as a binary hypothesis test, 
where hypothesis Ho denotes the absence of primary users and hypothesis Hi denotes the 
presence of primary users. Declaring wrongly Ho, or declaring correctly Hi, defines the false 
alarm probability P fa , and the detection probability P d , respectively. Since the sample covariance 
matrix R is a Wishart matrix, it is sufficient statistics for the population covariance matrix 
£ IfTBI . This leads to various test statistics as functions of R with different assumptions on the 
number of primary users P, and the knowledge of the noise power a 2 . 

In the case of a single primary user (P = 1) the hypothesis test can be expressed as 

H : S = a 2 I x (5) 
Hi : S = ( T 2 I x + 7l h 1 h t 1 . (6) 

Assuming known noise power a 2 the Largest Eigenvalue based (LE) detection (T LE = Ai) 
is shown to be optimal under the Generalized Likelihood Ratio Test (GLRT) criterion (3]|. 
Performance analysis of the LE detector can be found, e.g., in JU, 0, flU. Assuming 
unknown noise power, the optimal detector in the GLRT sense is the Scaled Largest Eigenvalue 
based (SLE) detection (T SLE = Ai/tr(R)) [H, Q, d. 
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In the presence of multiple primary users (P > 2), neither LE nor SLE detection are optimal 
and no uniformly most powerful test exists in this setting [|T4ll . To formulate a hypothesis 
test in this setting, one needs to consider the fact that for a secondary user the most critical 
information is whether or not there are active primary users. The knowledge of the number of 
active primary users may not be relevant from the secondary user's perspective. With the above 
considerations and also the fact that Ylf=i 7*h«h! * s a positive definite matrix, we choose the 
following hypothesis test in the multiple primary users scenario 

Ho : £ = <7 2 I* (7) 
Hi : Vya 2 I K , (8) 

where the noise power a 2 is assumed to be unknown and A y B denotes that A — B is a 
positive definite matrix. Essentially, we are now testing a null hypothesis £ = a 2 lx against 
all the other possible alternatives of S. Thus no a priory assumption on the structure of £ is 
required, except for its positive definiteness. In particular, deciding the number of primary users 
P is not needed, i.e., the hypothesis test is blind in P. Intuitively, this test is to reject "Ho if 
we have reason to believe that the population covariance matrix £ departs from the sphericity 
£ = a 2 I K . 

In the statistics literature, this hypothesis test is known as the sphericity test, which was first 
studied in [fT5l . Comprehensive results of the sphericity test for the real Wishart matrix R, 
including asymptotic distributions, can be found in [fT4l . The test statistics of this Spherical Test 
based (ST) detector was derived under the GLRT criterion as lfT5l 

det(R) r£,A, (9) 

For completeness, the essential steps of the derivation are outlined here. Apart from a constant, 
the likelihood function of the data matrix X is 

L(X|S) = (det(£)rV r( - s ~ lR) . (10) 

The likelihood ratio statistics is 

. = sup a2>0 L(X|a 2 I x ) 
su Ps^o L ( x l s ) 
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The maximum likelihood estimates of a 1 under Ho and £ under Hi are Ifl3lh 

fi = m, e = 5 (12) 

KN N V ' 



respectively. Inserting these into (11) we obtain 



= dfW := rsx , (13) 
(*MR)) 

Hypothesis "Ho is rejected if p is small i.e. when p l l N is small. Thus if T ST is greater than some 
threshold (, the detector declares Ho, otherwise Hi'. 

Ho 

T ST ^ C- (14) 

Wi 



Recently the spherical test is formulated in 111211 as a spectrum sensing algorithm. However the 
detection performance analysis in [fT2| relies on simulations only. We will address this analytically 
in the next section. Besides the ST detector, other competing detectors in the presence of multiple 
primary users include the Eigenvalue Ratio based (ER) detection (T ER = Xi/Xr) S, ifTOll . ifTTTl 
and John's detection ^Tj = Yn=i \7 (E£i ffUl, El, d. In the scenarios simulated 

in Section [TV] the ST detector outperforms the ER detector in the presence of multiple primary 



users and, when P is large, also John's detector. 

III. Performance Analysis 

In this section, we derive some closed-form performance metrics for the spherical test based 
detection. In the sequel, we present analytical formulae for its false alarm probability, detection 
probability, decision threshold and receiver operating characteristics. 

A. False Alarm Probability 

Define the Cumulative Distribution Function (CDF) of the random variable T ST under Ho by 
Fsr(y). Since Pf a relies on F ST (?/), we start by investigating the characteristics of Fsj(y). For 
the case of two sensors K = 2 and three sensors K = 3 with arbitrary sample size N, the exact 
PDFs of T ST under H can be found, e.g., in HI (Eq. (3.8)) and El (Eq. (ii) of Corollary 2.1) 
respectively. By definition, the CDFs for K = 2 and K = 3 can be obtained as 

, s 2B V (N -l,|)r(iV+i) 

F «® = ^tIn-I) • ^l ' 00 )- <15) 
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and 

f °m - - 1 g iS kt B ~ {N ~ hk+4) ' ae|0 - M) - (16) 

respectively. Here, B y (a, 0) = f Q v x^ 1 ^ — x) l3 ~ 1 dx is the incomplete Beta function and T(-) 
defines the Gamma function. The Pochhammer symbol (x) n equals (x) n = F p^"\ 

In principle for K > 3 the exact T$t distribution can be obtained by the standard approach 
of Mellin transform [fT4ll . The resulting density functions may involve the Meijer G-function or 
the Fox H-function [21 J. The results, although are of theoretical interest, appear to be of limited 
usefulness due to their complicated forms. 

Since explicit expression for the distribution of T ST may not be easily obtained for arbitrary 
K, it is more desirable to approximate the distribution by some known distribution, based on 
fitting the first few moments. In this paper, we choose the Beta distribution since it is defined 
on the same support [0, 1] as the random variable T ST . Additional motivation comes from the 
fact that the exact density functions in |fT9l for K = 2 and in II201 for K = 3 hold the same 
polynomial form x l (l — x) j as the Beta density. Accordingly we have 

Proposition 1. For any sensor size K and sample size N, the two-first-moment Beta-approximation 
to the CDF of T ST under % Q is 

where B(a,f3) = p?^^ is the Beta function. The parameters ao and /3q are given by 

M 1 (M 1 -M 2 ) = (l-A4i)(A4i- A4 2 ) 
°° M 2 -(M l ) 2 ' M 2 -{M 1 f 

with 

T(KN)K Kn T K {N + n) 
n T K (N) T(K(N + n)) 1 ; 

where 

T K (N) = tt^ k{k - 1) T(N)T(N - 1) • ■ ■ r(N -K + l). (20) 

The proof of Proposition [T] is in Appendix |Aj Here, ao and (3q are simple functions of the 
sensor size K and sample size N only. Note that asymptotic T ST distributions (w.r.t N) for 
real and complex Wishart matrices can be found in lfT4ll and E3ll respectively. Comparison of 



January 4, 2012 



DRAFT 



ACCEPTED IN IEEE TRANSACTIONS ON COMMUNICATIONS 



8 



approximation accuracy of the asymptotic distribution and Proposition [T] will be performed in 
Section I 



By (14), for any threshold £ the false alarm probability is obtained as 

Pfa(C) = F Sr (C). (21) 

Equivalently for any P fa a threshold can be calculated by numerically inverting Fst(C) 

C = F ST \P fa ). (22) 

B. Detection Probability 

Define the CDF of the random variable T ST under Hi by Gst(u)- Since is related to Gsr(y) 
we have the following result 

Proposition 2. For the case of two sensors K = 2 with arbitrary sample size N, the exact CDF 
of T ST under Hi is given by 

Gs«y) - 1 - C± (3 " ™ ~ +In-1), y€[0,oo) (23) 

^ (2k-l)\ \ai+a 2 J 2 

where the constant 

A{aia 2 ) N {ai + a 2 f-^ 

B(N,N-l)(ai-a 2 ) 2 K } 

and (Tj denotes i-th eigenvalue of the population covariance matrix S ([J]). 

The proof of Proposition [2] is in Appendix [B] Under Hi, exact representations for the dis- 
tribution of T S t exist in the literature, e.g., Theorem 4.1 in E51l and Equation (2.12) in [|26l . 
However, whilst being exact for arbitrary K and N, these representations involve an infinite sum 
of products of a Zonal polynomial and the Meijer G-function, which are difficult to compute. 
The exact distribution of T$t may not be easily obtained in a computable form when K > 2. 
On the other hand, for K = 2 we see that the distribution is a weighted sum of Beta functions. 
Since it is a standard technique in statistics to approximate a sum of Betas by one Beta, we 
extend this to arbitrary K as 

Proposition 3. For any sensor size K and sample size N, the two-first-moment Beta-approximation 
to the CDF ofT S T under Hi is 

G^) ^^ , ye[o,oo). (25) 
±}{ai,pi) 
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The parameters a,\ and 0i are given by 

M(M-M) R (i-M)(M-M) 

ai = ~T7 777T2"' W = 77 777T2 ( 26 ) 

AT 2 - (M) M - (A/1) 2 



_ AKA ^ r(a-Kn)r x (iV + n) (det(S))" 

r x (iv)r(a) (2/) 

where 

a = (iV + n) V / , 6=^f^. (28) 

The proof of Proposition [3] is in Appendix |C} Note that results on asymptotic T ST distribution 
for real Wishart matrices with arbitrary correlation can be found in lfl4ll . which may be gener- 
alized to the complex Wishart case. However, simulations show that the convergence of these 
asymptotic distributions can be very slow w.r.t sample sizes for high SNR. 



By ( |T4| ) the detection probability can be expressed as 

P d (C) = Gsr(C). (29) 
Note that if we further approximate the parameters (a ,f3 ) and (ai,/?i) to their respective nearest 



integer, both (21 ) and (29) reduce to simple polynomial equations in £. Thus the computational 
complexity of threshold calculation becomes quite affordable for on-line implementations. 



For a target P fa we can calculate the resulting threshold ( by (22). With this threshold the 



corresponding P d can be obtained from ( |29| ). The mapping between P fa and P d is the so-called 
receiver operating characteristics. Thus an analytical ROC expression for the ST detection can 
be obtained as 

Pd = GsT(P S T 1 OPfa))- (30) 



C. A Note on Approximation Error 

Based on Weierstrass approximation theorem, any square integrable function on a finite interval 
can be expressed in an orthogonal Jacobi polynomial basis, see e.g. OOl . The proposed two-first- 
moment Beta approximations in Propositions^ and [3] correspond to the simplest form of such an 
approximation, where two first polynomials matching the moments are used. The approximation 
error is related to the higher order polynomials left out from the approximation. The functional 
form of these higher order terms can be found, e.g., in Eq. (6) of OTTl . In light of Eq. (6) in OTTl . 
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the exact Pf a (C) an d Pd(C) can be written as a sum of the proposed Beta approximation and an 
error term e ai ,Pi(C), which equals 

(M n p \ 

K E B p ,U ai+n ~ q (} - ft J , i = 0, 1, (3D 
n=3 p=l (?=! / 



where v4 n and B p q n are some constants. Here, a , (3 are defined in (18) and ati, (3i are defined 



in ( |26| ). Due to the complicated form of the error term pi) , analysis on its behavior seems 
difficult. However, in the most interesting cases of low false alarm probability Pf a (C — * 0) and 
high detection probability P d (£ — > 1), the behavior of the error can be understood. Consider an 



infinitesimal e fulfilling < e <C 1, it follows from (31 ) that the leading order term in e QOi/ 3 (e) 
for low false alarm probability Pf a (e) is proportional to e a ° (when p — q — n in (31 1) and the 
leading order error in e ai ^(1 — e) for high detection probability P d (l — e) is e 13 ' 1 . Typically, the 
values a and (3\ are positive and large. For example, (a , equals (395.4, 17.0), (195.4, 19.1) 
and (95.5, 16.9) for the parameters considered in Figure [3j Figure [4] and Figure [5] respectively. 
Thus, the corresponding error for low P fa and high P d decreases quite fast. 

IV. Numerical Results 

In this section we first validate the derived approximative P ta and P d expressions by Monte- 
Carlo simulations. Then we investigate the performance of ST detection by comparing with 
several detection algorithms in the cases with and without noise uncertainty. The considered 
parameters K and N in this section reflect practical spectrum sensing scenarios. The sample 
size N can be as large as a couple of hundreds whereas the number of sensors K is typically 
less than eight due to physical constraints of the device size. Note that by using the results 
of ED, 11321 . a-priori information on the number of primary users may be exploited to improve 
the detection performance. 

A. False Alarm and Detection Probabilities 

In Figure [T] we compare the Beta approximated (Proposition [T]) and the asymptotic [|23l false 
alarm probabilities as a function of the threshold for various K and N . To quantitatively show 
the approximation accuracy, we calculate average CDF vertical difference^] of the proposed 

2 For a CDF, F(x), and its estimate F(x), the average CDF vertical difference is defined as ^5^7=1 \F( Xi ) ~ F( x i)[j l n ' 
where n is the sampling size. Here, we assume uniform sampling in the support of the distribution with n = 10 7 . 
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and the asymptotic approximations with respect to the exact distribution as resulting from 
simulations. The results, summarized in the caption of Figure [TJ show that the accuracy of 
the Beta approximation is not affected much by K and N, while the accuracy of the asymptotic 
distribution increases with N and decreases with K, as expected. For (K, N) = (4, 20), the Beta 
approximation is an order of magnitude better than the asymptotic result. 

In Figure [2] we plot the derived analytical detection probability versus simulations, assuming 
three simultaneously transmitting primary users (P = 3) with SNR X = — 1 dB, SNR 2 = —3 dB 
and SNR 3 = — 10 dB. Our focus here is detection in the low SNR regime, which is a practical 
and challenging issue in cooperative spectrum sensing. The Beta approximated Pa curves are 



calculated using (29), where the entries of the channel matrix H are independently drawn from a 
standard complex Gaussian distribution corresponding to Rayleigh fading. The channel is fixed 
during sensing and is normalized as = hj/||hj||. Without loss of generality, we set the powers 
of the zero mean Gaussian signal and noise to be 1. Thus, the population covariance matrix £ 
can be explicitly represented as a function of SNRs, i.e, £ = \ K + Y^=i SNRjUjuJ. With the 
same S, the corresponding simulated curve is plotted using 10 5 Monte Carlo runs. From Figure [I] 
it can be observed that the derived P d expression agrees with the simulations well. 



B. Detection Performance 

We compare the detection performance of the spherical test based detector with other known 
detectors by means of the ROC curves. Since the ROC curve shows the achieved detection proba- 
bility as a function of the false alarm probability, it reflects the overall detection performance for a 
given detector. We consider for comparison the cooperative Energy Detector T ED = | |X| lf33l . 
Il34l . In addition, the previously discussed Eigenvalue Ratio based detector T ER = Xi/Xr and 
John's detector T : = J2?=i \7 (E£i \) > which 

are candidate detectors in the presence of 
multiple primary users, are compared to. Optimal detectors for a single primary user, such as 
the LE detector T LE = Ai and the SLE detector T SL e = Ai/tr(R) are considered for comparison 
as well. 

In practical systems we may not have perfect knowledge of the noise power ll34ll . 11351 . 
The noise uncertainty may arise due to interference, noise estimation errors or non-linearity 

3 ||-||f denotes the Frobenius norm. 
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of components (341 . Hence for practical systems, modeling the noise uncertainty is unavoidable. 
The energy detector and the LE detector are subject to noise uncertainty due to dependence of 
the test statistics on the noise power (341 . (H. The SLE, ER, John's and the ST detectors are 
immune to noise uncertainty since the noise powers are replaced by their respective ML estimates 
in constructing the test statistics. Robustness to noise uncertainty is of fundamental importance 
because the uncertainty may severely degrade detection performance, especially at low SNR. If 
/i denotes the degree of noise uncertainty in dB, the actual noise power thus falls in the interval 
Q, = [a 2 1 p, pa 2 ], where p = 1(W 10 . As the same in (341, (351, (EH, in the following plots we 
consider the worst performance degradation due to noise uncertainty, where the noise power 
is pa 2 under Ho and a 2 /p under Hi. This noise uncertainty model is not only of theoretical 
interest [36] but also realistic (341 • For example, in order to protect the primary system and 
guarantee the quality of service for the secondary system, design margins on P ta and P m shall 
be imposed which can be only obtained from the worst case noise uncertainty analysis (361 . 
Note that the considered noise uncertainty levels here, 0.5 dB and 1 dB, are generally realistic 
in spectrum sensing scenarios. For example, it was remarked in (341 that the noise uncertainty 
can be at least 1-2 dB due to limitations of devices only and in (361 the authors considered noise 
uncertainty levels up to 3 dB. 

In order to see a clear picture of the impact of number of primary users P and the noise 
uncertainty /i on the detection performance, we plot various ROC curves assuming different P 
and p. In Figure [3] we show the performance of various detectors in the presence of a single 
primary user with SNRi = —3 dB using K = 4 sensors and iV = 400 samples per sensor. In 
Figure [4] we assume a scenario of three simultaneously transmitting primary users with SNRi = 
-1 dB, SNR 2 = -3 dB and SNR 3 = -10 dB. The number of sensors is chosen to be K = 4, 
and N = 200 samples per sensor are considered. The ST detector works also when the number of 
active primary users P is larger than the number of sensors K. This fact is illustrated in Figure |4} 
where we assume the existence of six primary users with SNR X = dB, SNR 2 = — 1 dB, 
SNR 3 = -3 dB, SNR 4 = -8 dB, SNR 5 = -10 dB and SNR 6 = -22 dB. We consider four 
cooperating sensors K = 4 with sample size N = 100. The analytical approximation of the ROC 
curves are obtained by p0| ), where we assume Rayleigh fading channels which are kept constant 
during sensing. With the same channel realizations, the corresponding numerical ROC is plotted 
as follows. Without loss of generality, we set the noise power a 2 = 1 at the secondary receiver. 
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At noise uncertainty level /x = dB (no uncertainty), /x = 0.5 dB and /x = 1 dB, the noise power 
becomes 1 (no uncertainty), 1.122 and 1.259 respectively under T-L and 1 (no uncertainty), 0.891 
and 0.794 respectively under Hi. For each ROC curve, 5 x 10 6 realizations of the data matrix X 
are drawn from a standard uncorrelated and correlated complex Gaussian distribution under Ho 
and Hi respectively with the corresponding noise powers given above. For each realization, the 
test statistics for the considered detectors are calculated under both hypotheses, from which the 
empirical test statistics distributions are obtained. Using the empirical distributions, the simulated 
ROC curves are constructed by comparing with 1000 equally spaced thresholds in the domain 
of each test statistics. 

Remarks on comparisons with John's detector. We observe in Figure [3] that John's detector 
outperforms the ST detector in the presence of a single primary user (P = 1). In this case, £ 
equals an identity matrix plus a rank one perturbation, where the strength of this perturbation 
is specified by the SNR. With an increased number of primary users (P = 3), the eigenvalues 
of £ become more distinct from each other, where it can be seen from Figure [4] that the ST 
and John's detectors perform almost equally well. When the number of primary users further 
increases (P = 6), the eigenvalues of £ become even more spread. In this case we see from 
Figure [5] that the ST detector outperforms John's detector. To further investigate their relative 
performance, we simulated their detection probabilities as a function of SNR, where the false 
alarm probability is set at 10~ 2 . We assume the presence of two active primary users (P = 2) 
with the difference of their SNR fixed: SNR 2 = SNRi - 2 dB using K = 4 sensors and iV = 50 
samples per sensor. The result is summarized in Table |I} where blue color indicates the higher 
Pd for a given SNR. From Table [I] we observe that when SNRs of the primary users increase 
(eigenvalues of £ become more distinct), ST detector achieves better performance than John's 
detector, though the differences are small. 

The above observations are consistent with the those in fl37), 11381 , fl39), where performance 
comparisons of John's and ST detectors were made. A common conclusion in 0711 . 113811 . [[391 is 
that the relative performance of John's and ST detectors depends on the rank of the perturbation 
matrix[] (in our setting the value of P) and the distinctness of its eigenvalues (in our setting it 

4 The perturbation matrix refers to the difference of the population covariance matrices under Hi and Ho, which, in our case, 
equals E^i7ih i h l t . 
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TABLE I 

Detection probability as a function of SNR 



SNRi in dB 


-1 


-0.5 





0.5 


1 


1.5 


2 


2.5 


3 


Pi of ST detector 


0.3628 


0.4668 


0.5891 


0.7119 


0.8105 


0.8939 


0.9482 


0.9817 


0.9935 


Pi of John's detector 


0.3721 


0.4745 


0.5901 


0.7057 


0.8094 


0.8910 


0.9458 


0.9781 


0.9929 



depends on the SNRs). A complete understanding of the conditions under which John's detector 
outperforms the ST detector, or vice versa, seems difficult partially due to the absence of a 
computable and accurate ROC for John's test. Despite this, more detailed understanding and 
consequently some general recommendations on the use of the tests can be made based on 
results of 071 . Q8l . 09"1 , and our observations. 

. In the case of two sensors, John's and ST detectors achieve the same performance since 
their test statistics, up to a linear transform, are the same when K = 2 OTll . 

• The performance gap of these detectors is not expected to be large 11371 , 11381 , (391, an d 
their asymptotic performance is the same (measured by the Pitman efficiency) 11371 . 

• With one active primary user (rank 1 perturbation matrix), John's detector is preferable 071 . 

• When the number of primary users is large (compared to the number of sensors) with 
not-too-low SNRs (distinct eigenvalues of £), the ST detector is preferable [f37J, |38l, [EH. 

John's test is the so-called locally best invariant test, i.e., it is the most powerful test in the 
neighborhood of H , although the neighborhood in which it is best is small 091 . In our setting, 
this effectively requires that the sum of SNRs is small [39] and the number of primary users is 
not too large (compared to sensor sizes) [38J. Since low SNR detection is of great interest, John's 
test is a viable alternative in the multiple primary users setting, despite that Q8l concluded by 
recommending the use of the ST detector in general. 

Remarks on comparisons with the SLE and the ER detectors: In Figure [3] we observe that 
the SLE detector has the best detection performance among the noise uncertainty free detectors 
considered. Indeed the SLE detector is proved to be optimal for single primary detection under 
the GLRT criterion [6J. When the number of the active primary users is more than one, we see 
from Figure [4] and Figure [5] that the ST detector performs better than the SLE detector. It can be 
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observed that the ST detector always achieves better performance than the ER detector. This is 
intuitively clear by examining their test statistics. For the ER detector, the test statistics depends 
only on the extreme eigenvalues of the sample covariance matrix R, whereas the test statistics 
of the ST detector is a function of all the eigenvalues of R. 

Remarks on comparisons with the LE and the ED detectors: It can be observed from subplot 
(a) of Figure [3} [4] and [5] that, in cases of perfectly estimated noise power, the ED and LE detectors 
almost always outperform the ST detector. However, the performance of ED and LE detectors 
are very sensitive to noise uncertainty which is particularly true when the number of the primary 
users are small. For example, from Figure [3] (b) and Figure [4] (c) we can see that both ED and 
LE detectors fail at /i = 0.5 dB when P = 1 and at /i = 1 dB when P = 3. Note that in 
practice noise uncertainty is always present Il34l thus the superb performance for the ED and 
LE detectors in the subplot (a) of each figure may not be achieved in real world scenarios. 

V. Conclusion and Future Work 

In this paper, we investigated the sensing performance of a multiple primary users detector, 
based on the spherical test. The ST detector estimates whether the covariance matrix differs from 
a matrix proportional to identity. Analytical formulae have been found for the key performance 
metrics of the ST detector. For generic values of the number of sensors K, the formulae are 
based on a two-first-moment Beta-approximation to the corresponding CDFs. The derived results 
are simple to calculate and yield an almost exact fit to simulations. From the simulation setting 
considered, performance gain over several detection algorithms is observed in scenarios with 
noise uncertainty and large number of primary users. 

The key message of this paper is that in the presence of more than one primary users, some 
performance gain may be obtained via the spherical test even without knowing the number of 
primary users. With only one primary user, however, the LE and SLE detectors prove to be 
optimal under the GLRT criterion when the noise power is known and unknown respectively. 
Naturally, one can argue that if some a-priori information on whether P = 1 or P > 1 is 
available, then by switching between the ST and the SLE (or the LE if the noise power is 
known) detection algorithms, advantages of these detectors can be dynamically exploited. The 
a-priori information may be acquired by utilizing the recent advances in estimation algorithm 
for P 0, ll32l . How to analytically capture the above 'estimation-assisted detector' remains as 
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interesting future work. 
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Appendix A 
Distribution of T st Under Ho 

Here we prove Proposition [T] We first derive the exact moments of T ST , which is valid for 
any K and N. Define the random variable Tst by 

det(R) 
(^tr(R)) 

where it can be verified that x E [0, 1]. Under H , the sample covariance matrix R follows an 
uncorrelated complex Wishart distribution Wk (N,Ik) with density function^] 

R~ f ^(det(R)f"V r (- R ), (33) 



X ■= ~ ; .{ K , (32) 



where T K (N) is defined in (20). Since X is a scalar function of matrix argument R, its n-th 
moment can be calculated as 

E[x n ] = ^j—J (det(R)) 7V - x+ "e tr (- R )(tr(R))"^dR (34) 

- ™" + "> / «g^ W R))-^ (35, 
Jnyo T K {N + n) 

-E[(tr(R'))^ n ], (36) 



where the last expectation is with respect to the Wishart matrix R' distributed as Wk (N + n, Ik)- 
The random variable 2tr(R') follows a Chi-square distribution with 2K(N + n) degrees of 
freedom, by using the moment expression for Chi-square distribution Il22ll (Eq. (2.35)), the 
(— Kn)-th moment of tr(R') is obtained as 

mrmf'l = j^fr^y (37) 

5 Since Tst is independent of a 2 , without loss of generality we set a 2 = 1. 
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The n-th moment of X is now 

[X 1 T K (N) T(K(N + n)) ' Mn ' { ^ 
Note that the expression for the exact moments can be also obtained by exploiting the indepen- 
dence between random variables X and tr(R) under Ho lfl5l . 



The first two moments of T ST can be obtained by using (38). For a Beta distribution with 
density function 

' x a °-\l- xf°~\ are [0,1] (39) 



B(a 0} (3 

equaling the first two moments to the moments of T$t we have 



Mi = a ° M 2 = ao[a ° + 1} . (40) 

a + /V («o + /?o)(«o + Po + 1) 



The parameters a and (3 are solved as in (18). This completes the proof. 



Appendix B 
Exact T st Distribution Under Hi for K = 2 

Here we prove Proposition [2j When K = 2, the test statistics T ST reduces to 

x --=T^k? " l0 - 11 ' (41) 

Under H\ the joint density of Ai and A 2 is [|24l 



Ci 



g ct i ct 2 — e CT2 CTi 



(AiA 2 ) 7V_2 (Ai - A 2 ) (42) 



where < A 2 < Ai < oo and C\ = fmbh^/mr^z^] ■ Making a change of variables Ai, A 2 to 
z = Ai, x = (j^+^)2 with Jacobian J = Z ^ L ~^^ - and integrating z out, the density of X reads 

c 2 x N -\\ - x)-v 2 (i - VT^) 2N ~ 2 (VT^ + x-l)x 

({a lX + a 2 (l - v/T^) 2 ) 1 ^ - (^x + ai(l - VT^f) 1 - 2 ") , (43) 
where Co = sttMPt^ \- In order to obtain the CDF of X, we first make a change of variable 

z B(N,N— 1)(<T2— ITlJ ' ° 

u> = \/l — x with Jacobian J = 2w, the density of W becomes 

2(<7! + a 2 ) 1 - 27V C 2W (l - W ) w - 2 (1 + w)^ 2 x 

\ 1-2AT / \ l-2Af N 



1 -w - 1 + — -w . (44) 

O"! + 0"2 / V °T + °"2 
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Now expanding [ 1± in power series of w and then integrating term- wise the 



density of W, the CDF of X is simplified to (|23j) by using the fact that G sr (y) := ¥(Y < y) = 
1 — F(W < y/l — y). This completes the proof. 

Appendix C 
Distribution of T st Under Hi 

Here we prove Proposition [3] We first derive an approximative moments expression of the 
random variable T ST . Under Hi, the sample covariance matrix R follows a correlated complex 
Wishart distribution Wk (N, S) with density function 

R 1 w (det(R)) 7V - i 'e tr (- s " R ). (45) 

T K (N) (det(E))" 

The n-th moment of random variable T S t can be calculated as 



K Kn C 

E[x n ] = = / (det(R)) Ar - /<+n e tr (- s " R )(tr(R))- Xn dR 

r K (iV) (det(E)f 7rm) 



r K (iV) (det(S)) J 
K Kn T K (N + n) f (det(R)) N - K+n e tr (" s " R ) 



(46) 



tr(R))-^ n dR (47) 



r K (7V) (det(S))-" i R ^o r K (iV + n) (det(S)) 



N+n 



r^AT) (det(S)) 

where the last expectation is with respect to R' distributed as Wk {N + n, £). The trace of R' 
can be represented as Il27ll 

K 

Z := tr(R') = (49) 

i=i 

where the random variables 2,2jS are i.i.d Chi-square distributed with 2(N+n) degrees of freedom. 
The exact density function for Z is available when no multiplicity of <7j exists, i.e. cr.; ^ Uj, 
Vz 7^ j ll28l . This effectively requires that £ is full rank or, equivalently, the number of active 
primary users P is greater or equal to the sensor size K. Due to this limitation, we opt for 
the Gamma approximation discussed in ||29l , which is still valid when multiplicities of <jj exist. 
Specifically, for a Gamma distribution with density T ^ ba x a ~ 1 e~%, the mean and variance are 
ab and ab 2 respectively. For the random variable Z, its mean equals 

K K 

e[z\ = ^ E N = ( N + n ) Yl ai (50) 

8=1 1=1 
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and variance equals 

K K 

\[z] = °?Vfe] = (N + n) °l (51) 

i=l i=l 

Fitting the mean and variance of a Gamma random variable to those of Z, we obtain the 



parameters a and b as in (28). With this Gamma approximation, the (— Kn)-\h moment for the 
trace of R' is 

r(o) 

Now the approximate moments of T ST are 

[X 1 \b J T K (N)T(a) -~ Mn - PJj 

Similar to the case under H , for a Beta distribution with parameters a x and (3\, by matching 



its first two moments to H\ and A/2 we obtain {(26]). This completes the proof. 
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Fig. 1. False alarm probability: Beta approximation versus asymptotic distribution 1231 . For (K, N) values (4, 20), (4, 50) and 
(8, 50), the average CDF vertical difference for the Beta approximation is respectively 4.92 x 10~ 8 , 5.02 x 10 -8 , 5.09 x 10~ 8 
and for the asymptotic approximation is respectively 4.14 x 10~ 7 , 4.01 x 10~ 8 , 1.73 x 10~ 7 . 
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Fig. 2. Detection probability (assuming three primary users with SNRi = —1 dB, SNR2 = —3 dB and SNR3 = —10 dB): 
Beta approximation versus simulation. 
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Pta Pfa Pfa 

(a) P = 1; At = dB (b) P = 1; /j, = 0.5 dB (c) P = 1; fi = 1 dB 

Fig. 3. ROC: assuming one primary user with SNRi = —3 dB. The parameters are K — 4 and N = 400. 




(a) P = 3; n = dB (b) P = 3; fj, = 0.5 dB (c) P = 3; fj, = 1 dB 

Fig. 4. ROC: assuming three primary users with SNRi = —1 dB, SNR2 = —3 dB and SNR3 = —10 dB. The parameters are 
K = 4 and N = 200. 




(a) P = 6; n = dB (b) P = 6; fj, = 0.5 dB (c) P = 6; fx = 1 dB 

Fig. 5. ROC: assuming six primary users with SNRi = dB, SNR 2 = -1 dB, SNR 3 = -3 dB, SNR 4 = -8 dB, 
SNR 5 = -10 dB and SNR 6 = -22 dB. The parameters are K = 4 and N = 100. 
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